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Abstract 



Bauschke, Borwein, and Lewis have stated a trichotomy theorem jH Theorem 
5.7.16] that characterizes when the convergence of the method of alternating 
projections can be arbitrarily slow. However, there are two errors in their 
proof of this theorem. In this note, we show that although one of the errors 
is critical, the theorem itself is correct. We give a different proof that uses 
the multiplicative form of the spectral theorem, and the theorem holds in 
any real or complex Hilbert space, not just in a real Hilbert space. 



1991 Mathematics Subject Classification: 41A65, 46C05, 46N10, 47H09. 
Key Words and Phrases: alternating projections, cyclic projections, orthogo- 
nal projections, angle between subspaces, rate of convergence of the method 
of alternating projections. 



1 Introduction 



For the notation and basic Hilbert space results necessary to read this paper, 
the book [6J is a good source, especially chapter 9. 

Let if be a (real or complex) Hilbert space with inner product (x, y) and 
norm ||x II = ^/J^. If M is any closed (linear) subspace of H, let Pm 
denote the orthogonal projection onto M. That is, Pm : H M is defined 

by 

\\x — PmMW = inf ||x — ull. 

Let Ml and M2 be closed subspaces in H and M := Mi n M2. It is well- 
known that PmiPm2 = Pm if and only if Pm^ and Pm2 commute: PmiPm2 = 
Pm2Pmi- Von Neumann established the following result which yields an 
interesting analogue in the non-commuting case. 

Theorem 1.1 (von Neumann | |13] ) For each x & H , there holds 

lim \\{Pm2PmT{x) - Pm{x)\\ = 0. (1.1) 

The method of constructing the sequence {Pm2PmiY{,x) by alternately 
projecting onto one subspace and then the other is called the method of 
alternating projections. While Von Neumann's theorem shows that the se- 
quence of iterates {PM2PNhY{.x), always converges to Pm{x) for every x, it 
does not say anything about the speed or rate of convergence. To say some- 
thing about this, we will use the notion of angle between subpaces. Recall 
that the (Friedrichs) angle between the subspaces Mi and M2 is defined to 
be the angle in [0, 7r/2] whose cosine is given by 

c(Mi,M2) := snp{\{x,y)\ \ x E MiH HBh, y E M2 H M^ n 5/^}, 

where Bh '■= {x G H \ \\x\\ < 1} is the unit ball in H. It is easy to see that 
< c(Mi,M2) < 1. 

Theorem 1.2 (Aronszajn |T]) For each x E H and n > 1, we have 

\\{PM2PAhnx) - Pm{x)\\ < c(Mi,M2)2"-lx||. (1.2) 

Kayalar and Weinert p2| showed that the constant in Aronszajn's theo- 
rem is smallest possible independent of x. More precisely, they proved that 

\\{Pm2PmX - PmW = c(Mi,M2)'"-^ for each n G N. (1.3) 

The usefulness of the bound in f 1 1.2 1) depends on knowing when the cosine 
of the angle between Mi and M2 is less than one, i.e., when the angle is 
positive. A useful characterization of when this happens is the following. 
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Lemma 1.3 c(Mi, M2) < I if and only if Mi + M2 is dosed. 

This lemma is a consequence of results of Deutsch [5] and Simonic, whose 
result appeared in p] Lemma 4.10] (see also [HI Theorem 9.35, p. 222]). 

Recall that a sequence is said to converge to x linearly provided 
there exists an a < 1 and a constant c such that 

||a^ri — < ca"' for each n > 1. 

In this case, we say that the rate of convergence is a. 

Using Lemma [131 and Theorem II. 2 ^ we see that there is linear convergence 
for the method of alternating projections whenever the sum of the subspaces 
is closed. What can be said when the sum is not closed? 

Franchetti and Light pUj gave the first example of a Hilbert space and two 
closed subspaces whose sum was not closed such that: given any sequence of 
reals decreasing to zero, there exists a point in the space with the property 
that the convergence in the von Neumann theorem was at least as slow as 
this sequence of reals. But this still left open the question of whether such a 
construction could be made in any Hilbert space whenever Mi and M2 were 
any closed subspaces whose sum was not closed. 

In their study of the method of alternating projections, Bauschke, Bor- 
wein, and Lewis |1] stated the following dichotomy. (Actually, they stated 
their result as a trichotomy since they were considering the more general set- 
ting of closed affine sets, i.e., translates of subspaces, rather than subspaces. 
In this situation, unlike the subspace case, one must also consider the pos- 
sibility that the intersection of the affine sets is empty. However, when the 
intersection is nonempty, the affine sets case easily reduces to the subspace 
case by a simple translation.) Roughly speaking, it states that in the method 
of alternating projections, either there is linear convergence for each starting 
point, or there exists a point which converges arbitrarily slowly. 

Theorem 1.4 (dichotomy) Let Mi and M2 be closed subspaces in a Hilbert 
space H and M = Mi fl M2. Then exactly one of the following alternatives 
holds. 

(1) Ml + M2 is closed. Then for each x E H , the sequence {Pm2Pmi)"'{x) 
converges linearly to Pm{x) with a rate [c(Mi,M2)]^. 

(2) M1+M2 is not closed. Then for each x E H , the sequence {Pm2Pmi)^{x) 
converges to Pm{x). But convergence is "arbitrarily slow" in the fol- 
lowing sense: for each sequence (A„) of positive real numbers with 
1 > Ai > A2 > • • ■ > A„ — s> 0, there exists a point xx E H such 
that 

\\{Pm,PmTM - PmMW > Xn for alln. 
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Remark Clearly, the first statement of Theorem 11.41 is an immediate conse- 
quence of Theorem 11.21 and Lemma 11.31 Thus we need only verify the second 
statement. We will do this in Section [3] below. 

2 Multiplicative form of the spectral theorem 

The main fact that we will use in the proof of Theorem 11.41 is the multi- 
plicative form of the spectral theorem (see Halmos yjj or Reed-Simon f!M 
Corollary on p. 227]). Recall that a bounded linear operator U : Hi ^ H2 
between Hilbert spaces Hi and H2 is called unitary if U is invertible and 
U* = U^^. It follows that a unitary operator is isometric: \\Ux\\ = \\x\\ for 
each X E H. Since the inverse of a unitary operator is unitary, it too is 
isometric. (We will use these facts in a few places below without explicit 
mention.) 

Theorem 2.1 (Spectral Theorem; multiplicative form) Let H be a 

(real or complex) Hilbert space, and let T be a self-adjoint bounded linear 
operator on H. Then there exists a finite measure space {fi,fJ,), a bounded 
real-valued function F on Q, and a unitary map U : H ^ L2(fi,/i) such that 

UTU~'f = F-f forallfeL2{Q,fi). (2.1) 

Defining D : L2(fi,yu) L2{fl,n) to be the operator "multiplication by F", 
{Df){t) := F{t)f{t), this can be expressed in operator notation as 

UTU-^ = D. (2.2) 

Actually, in both [TT] and [13], the theorem is stated for a complex Hilbert 
space only, and [H] even assumes separability. However, it is easy to check 
that each of the tools used in the proof in [TT], for example, has a corre- 
sponding real space analogue. 

Acknowledgements We are greatly indebted to Joel Anderson, Nigel Hig- 
son, and Barry Simon for personally transmitting some very useful comments 
to us related to the multiplicative form of the spectral theorem. 

A self-adjoint operator T on if is called positive if {Tx, x) > for each 
X E H. A simple, but important, example of a positive operator is the 
orthogonal projection P5 onto any closed subspace S C H (see, e.g., P, p. 
79]). 

Corollary 2.2 Assume the hypothesis of Theorem \2.1\ IfT is also positive, 
then the bounded real-valued function F of Theorem \2.1\ is also nonnegative 
a.e.(/i). 
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Proof. Let / G L2(fi,/i) be arbitrary and y = U ^f. Since T is positive, 
we have that 

/ = {FfJ) = {DfJ) = {UTU-'fJ) 

= {TU-'f,U*f) = {Ty,y)>0. 

Briefly, F\f\'^dfi > for each / e L2(fi, yu). We readily deduce that F > 
a.e.(/i). ■ 



3 Proof of Theorem 11.41 



In this section we will prove the second statement of Theorem II .41 Our proof 
is along the same general lines as in [Ij in that we proceed by a series of small 
steps that are each easily digested. However, there are subtle errors in steps 
2 and 3 of [4] (see Section H] for the details). We will avoid these errors by 
using Theorem 12.11 and following a somewhat different path. 

Proof of the second statement in Theorem 11.41 Suppose Mi + M2 
is not closed, and let (A„) be a sequence with 1 > Ai > A2 > ■ ■ • > A„ > 0, 
and A„ 0. By LemmaOl c(Mi,M2) = 1. Let 

A = MiHM^ and 5 = M2 n M^. (3.1) 

Note that A and B are closed subspaces with An B = {0}. Clearly, 

c(A,5) = c(Mi,M2) = 1 (3.2) 

and hence, by Lemma [T!^ again. A+B is not closed. Since c{A, B) = \\PbPa\\ 
by [3] (see also [HI Lemma 9.5(7), p. 197]), it follows that UPeP^II = 1- 

Lemma 3.1 The operator T := P^PePa is a bounded self-adjoint linear 
operator on H which is positive and \\T\\ = 1. Hence there exists a finite 
measure space a nonnegative bounded function F on Q, and a unitary 

operator U : H —> L2 := L2{fl,fi) such that 

UTU'^ = D, (3.3) 

where D : L2 —>■ L2 is defined by Df := Ff for each / G L2. 

Proof of Lemma l3J\ By Corollary 12. 2^ it suffices to verify the first state- 
ment of the lemma. Clearly, T is self-adjoint and bounded. Moreover, using 
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P Corollary 5.17], ||T|| = \\PaPbPa\\ = \\PbPaV = 1- Fix any x e H and 
set y = Pax. Since Pb is positive, we have that 

{Tx,x) = {PaPbPax,x) = {PbPax,Pax) = {PBV.y) > 0. 

This shows that T is positive on H and completes the proof of Lemma 13.11 

For each k G N := {1,2,...}, let be the largest integer such that 
SfcA/fc < 1. Then the following claim is clear. 

Claim 1. SfcAfc < 1 < (s^ + l)Afc for aii /c G N, si < S2 < S3 < ■ ■ ■ , and 
each Sk occurs only finitely often. 

Next let {in) be the strictly increasing sequence of integers with 

{t,,t2,...} = {Si,S2,...}. (3.4) 

Note that since is a subsequence of (n), it follows that 

00 ^ 

5^-<oo. (3.5) 
1 

For each n G N, we define 

ko{n) := min{fc | Sk = tn} and ki{n) := maxj/c | Sk = tn}- (3.6) 
It is clear that ko{n) 00, ki{n) 00, and 

'Sfco(n.)-l = tn-l < tn = SkQ{n) = Sko{n)+l = ■ ■ ■ = Sfc^(„,) < tn+1 = Ski{n)+1- 

(3.7) 

Set 

ttn := (Afco(n)tn)^ for cach neN. (3.8) 
Claim 2. For eaci n G N, 

1 > AfcQ(„)Sfco(„) = Xko{n)tn > 1 — Xko{n), (3.9) 

< ttri < 1, and a„ — >• 1. (3.10) 

To see this, note that by definition, Xko{n)tn = Xko{n)Skoin) < 1, and 
1 < Xko{n){sko{n) + !)• But the latter inequality implies that 1 — \ko(n) < 
Afco(n)Sfco(n) = Afco(„)t„. Also, \ko{n)tn < 1 implies that a„ < 1. Since 
'^fco(n) 0, relation fl3.9p implies that Xko(n)tn 1- This, along with 
ki{n) —>■ oo, shows that a„ ^ 1, which completes the proof of Claim 2. 
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We note that the first two claims follow exactly as in the proof given in 
[1]. However, at this point our approach will deviate significantly from that 
of ID- 
Claim 3. fi{F'\[l,oo))} = 0. 

To see this, let S := oo) and y = U^^{xs), where xs denotes the 

characteristic function of S: xs(t) = I if t & S and otherwise. We must 
show that fi{S) = 0. Since 

\\y\\ = \\U-\xs)\\ = WxsW = Irf/i) ^ = (3.11) 

it suffices to show that y = 0. Using (13.111) . we have 

\\Ty\\ = \\U-'DUy\\ = \\U-'Dixs)\\ = \\U-\Fxs)\\ = \\Fxs\\ 



> 



\y\\- 



(3.12) 



This shows that \\Ty\\ > \\y\\. But since T = PaPrPa is the product of 
norm one operators, \\Ty\\ < \\y\\. Thus \\Ty\\ = \\y\\. We deduce that 



\y\\ = WPAPsPAyW < \\PBPAy\\ < \\PAy\\ < \\yl 



(3.13) 



Thus we must have equality holding throughout the string of inequalities 
(I37[3|) . It follows (see, e.g., P Theorem 5.8(2), p. 76]) that y e AnB = {0} 
and hence y = 0. This proves Claim 3. 

Claim 4. For each e > 0, fi{F-\{l - e, 1))} > 0. 

If not, there exists e > such that /i{F^^((l — e, 1))} = 0. Choose any 
y E H and set g = Uy. Then, using Claim 3, we have that 



\\Ty\\'' 



U-'DUyf = \\DUyr 
F'\g\'dfi + 



'F-i([0,l-e]) 



\Dgf = j \Fg\'dfi 
F^\g\^dfi + 



F^\g\'^dfi 
F^\g\^dix 



\g\'^di2 + + 0<{l-ef / \g\'^dfi 



F-i([0,l-£]) 



[i-erWyf. 



Briefly, ||Ty|| < (1 — £:) for each y E H. It follows that ||T|| < l — e, which 
(by Lemma [3. II) contradicts ||T|| = 1. This proves Claim 4. 
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To verify this, we use Claim 4 and the countable additivity of // to obtain 



This proves Claim 5. 

Claim 6. There exists a sequence of reals {j3n) C (0, 1) such that < 
Pn < Pn+1 < 1 and ii{F-^ ([/?„, pn+i))} > for each n e N. 

We prove Claim 6 by induction. For n = 1, take Pi = of. Then (3i < 1. 
Assume next that f3i,...,f3rn have been chosen so that f3i < (32 < ■ ■ ■ < 
I3m < I, I3k > Oil for A; = 1,2,..., m, and /^{P-^ {[jSk, Pk+i)) > for A; = 
1, 2, . . . , m — 1. Let e :— min{l — a^+i, 1 — Pm}- Then £ > and Claim 
5 implies the existence of £i e (0, e) such that //{-F"^ ((1 — £, 1 — £i))} > 0. 
Let Prn+i := 1-Si. Then Prn+i > l-e > 0rn- Also, Prn+1 > I- e > 
Finally, fi{F-^ {[Pm, Prr.+i))} > f^{F-' ([1 - £, 1 - £i))} > 0. This completes 
the induction step and hence the proof. 

Definition 3.2 With f3n given as in Claim 6, for each n e let Sn '■— 
and define the vector Cn & H by 




Thus there exists an integer i such that /jl {F ^ ((1 — |, 1 — ^])} > 0. Let 
£i = j^. Then ei e (0, e) and 



{F-' ((1 -e,l- El))} > [f ' ((1 - J, 1 - eO) } > 0. 



1 



■n 



Note that 



1 



Ue. 



■n 



XSn • 



Claim 7. lie 



■n 



1 for each n e N. 



This follows from 



It is convenient to list next a few basic and easily verified facts concerning 
powers of T and D. 

Claim 8. 

(1) = {U-^DUf = U-^D'^U. 

(2) D^f = F^f iorall f eL2{n,^i). 

(3) Iff, g e L2{n, fj,) and f{t)g{t) = for fx almost all t, tlien {D^f, D'^g) = 
for every j,k eNU {0}. 

Claim 9. For all integers j,k E NU {0} and m,n E N with m n, we 
have 

(T^e^,T'=e„) = 0. 



To verify this, let := f/e^ = -^i=xs, for each r G N. Then Xs„X5„ = 

Xs„nSm = since SnCi Sm = 0. Thus fnfm = 0. Using statements (1) and 
(3) of Claim 8, we get that 

Claim 10. (3^^^^ > \\T''eJ > Pn > for all k,neN. 
The last inequality follows from Claim 6. Next observe that 

2 



\U-^D^Ue. 



Also, by the definition of 5"^ (in Definition 13.21) . it is clear that 

1 



/3f < 



2k 



dfi 



Taking square roots completes the proof of Claim 10. 

Now we can define the element which will converge slower than the se- 
quence (A„). 
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Definition 3.3 Set 



oo ^ 



Since < Y^^l"^ < ^ ^^'^ ll^nll — follows that x\ is a 

well-defined element of H. 

Claim 11. IIT'^xaII > /t„ for aii n, A; G N. 

We deduce 

n m 

= E^ll^'^^ll' (by Claim 9) 



n 



> — ||T^e„||^ for each n 



ik 

> \ (by Claim 10). 

Thus ||T^xa|| > a'^ /tn as claimed. 

Claim 12. || (PBP4)^a;A|| > Afc for eacii A; G N. 

Fix any G N and choose n G N such that ko{n) < k < ki(n). Using 
Claim 11, we get that 

2k 2fci(n) 

\\{PBPAfxx\\ > \\Pa{PbPa)'xx\\ = WT'^X.W >-^> = Xk,in) > A,, 

which proves Claim 12. 

Claim 13. For each keN, {Pm^Pm,Y - Pm = {PbPaY ■ 

Using the facts that M = MinM2, Pm-^ = I—Pm-, and Pm^ is idempotent 
and commutes with both Pm^ and Pm2 (see, e.g., [6j p. 194]), we get that 
Pm,Pm^ = PAhnM^ for z = 1, 2 and 

{PM2PM1Y ~ Pm = {PM'zPMiYil — Pm) = {Pm2Pmi)^Pm^ 

= {Pm2Pm^PmxPm^)^ — {PM2r\M^PMinM^)^ 
= {PbPa)\ 
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which proves Claim 13. 

Combining Claims 12 and 13, we immediately obtain 

Claim 14. \\{PM,PM,f{xx) - PmMW > \u for each A; e N. 

This completes the proof of the second statement of Theorem ll.4[ 



4 Two errors in 114 



In this section, we point out two errors in [Ij. We shall use the notation of 
[1]. (Note that this is the same as the notation of the present paper except 
that here we have used Mi, M2 instead of Ci, C2.) 

First error. The proof of the Claim in Step 2 of the proof of Theo- 
rem 5.7.16 in |4J has a mistake. The Claim itself is correct, only the proof of 
this claim is incorrect. 

Specifically, we inductively construct (e^) and (/^) in A and 5, respec- 
tively. Let E and F be the finite-dimensional spaces as in the proof. Let 
{an) in A and in B as in the proof: 

||a„|| = 1 = ||6n|| and (a„,6„)^l, (4.1) 

and — i> weakly and 6„ — weakly. Because E + F is finite- dimensional, 
the sum A-^ + {E + F) is closed. Hence {A-^, E + F} is regular (by [SI 
Proposition 5.16]) and so is {A^^, {E + F)^} = {A, E^ n F^} (again by P 
Proposition 5.16]). This means the following by definition of regularity. 

Observation. If {zn) is a bounded sequence with 
max{d{zn,A),d{zn,E^ n F^)} 0, then d{zn,An E^ n F^) 0. (And 
analogously when A is replaced by B.) 

Now back to the proof of the Claim. This time, Pe+f is a compact 
operator. (In Pj, Pe and Pp were considered, which is not sufficient.) Since 
a„ — s> weakly and 6„ — * weakly, we deduce that 

PE+pan and PE+pbu ^ 0. (4.2) 

Since {E + F)-^ = E-^ n F-^ , this implies 

an - PE^nF^an and 6„ - Pe^hf^K -> 0; (4.3) 

equivalently, 

d{an, E^ n F^) ^ and d{bn, E^ n F^) ^ 0. (4.4) 
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The above Observation now implies (i(a„, A fl E-^ fl F^) — > and d{bn, B fl 
n F^) 0; equivalently, 

«n — PAnE^nF^(^n — ^ and 6„ — PsnE^nF^bn ^0. (4.5) 

In view of f l4.ip . we deduce that 

(-PAnE^nF-Lan, -PenB-LnF-L^n) ^ 1- (4.6) 

Thus, for all n sufficiently large, we have H-PAns-LnF^'^nll < 1; ll-PenF^nF^^nll ^ 
1, ^Ani?^nFi«n e A n n F^, PBnE^nF^bn e B f] E^ D F^ , and 
{PAnE^nF^CLn, PsnE-^nF-^bn) is as close to 1 (from below) as we like. Then for 
n sufficiently large, we can take e'^_^i = Pj^f^E^nF^(^n and f^+i = PBnE^nF^^n- 

Second error. The second error is on the third line on page 32 of [1], 
where it is claimed that 

Ci = {CinC2)®E®{AnE^nF^), C2 = {Cir\C2)®F®{Br\E^nF^). (4.7) 

Unfortunately, only 

Ci = (Ci n C2) © F © (A n E^), C2 = (Ci n C2) © F © (s n f^) 

is true. This invalidates the rest of the proof in |3]. 

Here is a counterexample to (14. 7p . Let {un \ n G N} be an orthonormal 
basis of a separable Hilbert space. Set 

Ci := span{M2„ + ^M2n-i I n e N} and C2 := span{M2n + ^M2n+i | n G N}. 
Then 

CinC2 = {0}. (4.8) 

(Sketch: the spanning vectors are orthogonal. Normalize and use Fourier 
expansions. Equate coefficients, compare odd and even ones. Deduce that 
they are all equal; thus they must be equal to 0.) Hence A = Ci and B = C2- 
Set 

en = e'„ = pn{u4n + ^M4n-l) and fn = fn = Pn {uin + ^M4„+l) , (4.9) 

where p„ := (1 + 4^)"^^^ Since {e'^J^) = (1 + ^)~\ the sequences (e^) 
and (/^) are as in the Claim of Step 2, and the sequences (e„) and (/„) are 
as in Step 3. Set 

F = span{e„ I n G N} and F = span{/„ | n G N}. (4.10) 
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Then 

E + F = span{2ne4„ + u^n-i, '^ne^n + W4„+i | n G N} (4-11) 

is a subspace of span{u4n-i, u^n, u^n+i I n ^ P^}- Thus {ui, U2, u^, Uio, . . .} C 
{E + F)-^. Since the orthogonal complement of span{2ne4„ + u^n-i, 2ne4„ + 
U4n+i I n G N} in span{M4„_i, ■U4„, 'U4„+i | n G N} is span{— 2n-U4„_i + u^n — 
2nu^n+i I 7^ G N}, we obtain 

E^nF^ = (E + F)^ (4.12) 
= span{Mi, M4n-2) " 2n-U4„_i + u^n — 2nM4„_|_i I n G N}. 

Consider the vector x := Uq + ^u^. Then x belongs to Ci = A. Since 
E C span{u4„_i, U4„ | n G N}, it follows that x G E-^ and hence PeX = 0. 
Now consider the first term in the false statement (14. 7p . which in our present 
situation becomes 

A = E®{AnE^nF^). (4.13) 

This would imply that x belongs entirely to A fl E-^ fl F-^. While it is true 
that X E A n -E"*", it is not true that x belongs to E-^ fl F-^. This can be 
verified using relation fl4.12p . 
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